143 research outputs found

    Multi-modal joint embedding for fashion product retrieval

    Get PDF
    © 20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Finding a product in the fashion world can be a daunting task. Everyday, e-commerce sites are updating with thousands of images and their associated metadata (textual information), deepening the problem, akin to finding a needle in a haystack. In this paper, we leverage both the images and textual meta-data and propose a joint multi-modal embedding that maps both the text and images into a common latent space. Distances in the latent space correspond to similarity between products, allowing us to effectively perform retrieval in this latent space, which is both efficient and accurate. We train this embedding using large-scale real world e-commerce data by both minimizing the similarity between related products and using auxiliary classification networks to that encourage the embedding to have semantic meaning. We compare against existing approaches and show significant improvements in retrieval tasks on a large-scale e-commerce dataset. We also provide an analysis of the different metadata.Peer ReviewedPostprint (author's final draft

    Multi-modal fashion product retrieval

    Get PDF
    Finding a product in the fashion world can be a daunting task. Everyday, e-commerce sites are updating with thousands of images and their associated metadata (textual information), deepening the problem. In this paper, we leverage both the images and textual metadata and propose a joint multi-modal embedding that maps both the text and images into a common latent space. Distances in the latent space correspond to similarity between products, allowing us to effectively perform retrieval in this latent space. We compare against existing approaches and show significant improvements in retrieval tasks on a largescale e-commerce dataset.Peer ReviewedPostprint (author's final draft

    Multi-modal embedding for main product detection in fashion

    Get PDF
    © 20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Best Paper Award a la 2017 IEEE International Conference on Computer Vision WorkshopsWe present an approach to detect the main product in fashion images by exploiting the textual metadata associated with each image. Our approach is based on a Convolutional Neural Network and learns a joint embedding of object proposals and textual metadata to predict the main product in the image. We additionally use several complementary classification and overlap losses in order to improve training stability and performance. Our tests on a large-scale dataset taken from eight e-commerce sites show that our approach outperforms strong baselines and is able to accurately detect the main product in a wide diversity of challenging fashion images.Peer ReviewedAward-winningPostprint (author's final draft

    BASS: boundary-aware superpixel segmentation

    Get PDF
    © 20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.We propose a new superpixel algorithm based on exploiting the boundary information of an image, as objects in images can generally be described by their boundaries. Our proposed approach initially estimates the boundaries and uses them to place superpixel seeds in the areas in which they are more dense. Afterwards, we minimize an energy function in order to expand the seeds into full superpixels. In addition to standard terms such as color consistency and compactness, we propose using the geodesic distance which concentrates small superpixels in regions of the image with more information, while letting larger superpixels cover more homogeneous regions. By both improving the initialization using the boundaries and coherency of the superpixels with geodesic distances, we are able to maintain the coherency of the image structure with fewer superpixels than other approaches. We show the resulting algorithm to yield smaller Variation of Information metrics in seven different datasets while maintaining Undersegmentation Error values similar to the state-of-the-art methods.Peer ReviewedPostprint (author's final draft

    An ideal mass assignment scheme for measuring the Power Spectrum with FFTs

    Full text link
    In measuring the power spectrum of the distribution of large numbers of dark matter particles in simulations, or galaxies in observations, one has to use Fast Fourier Transforms (FFT) for calculational efficiency. However, because of the required mass assignment onto grid points in this method, the measured power spectrum \la |\delta^f(k)|^2\ra obtained with an FFT is not the true power spectrum P(k)P(k) but instead one that is convolved with a window function W(k)2|W(\vec k)|^2 in Fourier space. In a recent paper, Jing (2005) proposed an elegant algorithm to deconvolve the sampling effects of the window function and to extract the true power spectrum, and tests using N-body simulations show that this algorithm works very well for the three most commonly used mass assignment functions, i.e., the Nearest Grid Point (NGP), the Cloud In Cell (CIC) and the Triangular Shaped Cloud (TSC) methods. In this paper, rather than trying to deconvolve the sampling effects of the window function, we propose to select a particular function in performing the mass assignment that can minimize these effects. An ideal window function should fulfill the following criteria: (i) compact top-hat like support in Fourier space to minimize the sampling effects; (ii) compact support in real space to allow a fast and computationally feasible mass assignment onto grids. We find that the scale functions of Daubechies wavelet transformations are good candidates for such a purpose. Our tests using data from the Millennium Simulation show that the true power spectrum of dark matter can be accurately measured at a level better than 2% up to k=0.7kNk=0.7k_N, without applying any deconvolution processes. The new scheme is especially valuable for measurements of higher order statistics, e.g. the bi-spectrum,........Comment: 17 pages, 3 figures, Accepted for publication in ApJ,Matches the accepte

    Controlled Interfacial Reactions and Superior Mechanical Properties of High Energy Ball Milled/Spark Plasma Sintered Ti–6Al–4V–Graphene Composite

    Get PDF
    Ball milling process has become one of the effective methods for dispersing graphene nanoplates (GNPs) uniformly into matrix; however, there are often serious issues of structural integrity and interfacial reactions of GNPs with matrix. Herein, GNPs/Ti‐6Al‐4V (GNPs/TC4) composites are synthesized using high energy ball milling (HEBM) and spark plasma sintering. Effects of ball milling on microstructural evolution and interfacial reactions of GNPs/TC4 composite powders during HEBM are investigated. As ball milling time increase, particles size of TC4 is first increased (e.g., ≈104.15 μm, 5 h), but then decreased to ≈1.5 μm (15 h), which is much smaller than that of original TC4 powders (≈86.8 μm). TiC phases are in situ formed on the surfaces of TC4 particles when ball milling time is 10Thinsp;h. GNPs/TC4 composites exhibit 36–103% increase in compressive yield strength and 57–78% increase in hardness than those of TC4 alloy, whereas the ductility is reduced from 28% to 7% with an increase of ball milling time (from 2 to 15 h). A good balance between high strength (1.9 GPa) and ductility (17%) of GNPs/TC4 composites is achieved when the ball milling time is 10 h, attributing to the synergistic effects of grain refinement strengthening, solid solution strengthening, and load transfer strengthening from GNPs and in situ formed TiC

    Temperature-independent ferroelectric property and characterization of high-TC 0.2Bi(Mg1/2Ti1/2)O3-0.8PbTiO3 thin films

    Get PDF
    Ferroelectric property stability against elevated temperature is significant for ferroelectric film applications, such as non-volatile ferroelectric random access memories. The high-TC 0.2Bi(Mg1/2Ti1/2)O3-0.8PbTiO3 thin films show the temperature-independent ferroelectric properties, which were fabricated on Pt(111)/Ti/SiO2/Si substrates via sol-gel method. The present thin films were well crystallized in a phase-pure perovskite structure with a high (100) orientation and uniform texture. A remanent polarization (2Pr) of 77 μC cm-2 and a local effective piezoelectric coefficient d33* of 60 pm/V were observed in the 0.2Bi(Mg1/2Ti1/2)O3-0.8PbTiO3 thin films. It is interesting to observe a behavior of temperature-independent ferroelectric property in the temperature range of room temperature to 125°C. The remanent polarization, coercive field, and polarization at the maximum field are almost constant in the investigated temperature range. Furthermore, the dielectric loss and fatigue properties of 0.2Bi(Mg 1/2Ti1/2)O3-0.8PbTiO3 thin films have been effectively improved by the Mn-doping
    corecore